Manipulating treacheoesophageal speech
نویسندگان
چکیده
Speech therapy aiming at improving voice quality and speech intelligibility is often hampered by the lack of knowledge of the underlying deficits. One way to help speech therapists treating patients would be to supply synthetic benchmarks for pathological speech. These can be used to train therapists and evaluate and interpret automatic speech recognizers used for diagnosing pathological speech. Moreover, synthetic pathological speech can also be used to make expected therapy aims audible before treatment. In a listening experiment testing perceived intelligibility, three types of manipulations of tracheoesophageal speech were evaluated by experienced speech therapists. It was found that modeling the intensity contour of the voice source signal improved speech quality over plain analysis-synthesis. Replacing the voicing source with fully synthetic source periods decreased the perceived intelligibility markedly. Making the source fully periodic with a regular pitch had no effect on perceived intelligibility. Low quality speech benefitted more from manipulations, or deteriorated less, than high quality speech.
منابع مشابه
Manipulating speech pitch periods according to optimal insertion/deletion position in residual signal for intonation control in speech synthesis
This paper describes the investigation of manipulating positions in a speech pitch when lengthening or shortening the pitch period, that is, lowering or raising fundamental frequency of speech. The experimental results revealed that the preferable positions were at the first half of the pitch period for pitch shortening, and at the second half of it for pitch lengthening. The findings are expec...
متن کاملThe influence of selective attention to auditory and visual speech on the integration of audiovisual speech information.
Conflicting visual speech information can influence the perception of acoustic speech, causing an illusory percept of a sound not present in the actual acoustic speech (the McGurk effect). We examined whether participants can voluntarily selectively attend to either the auditory or visual modality by instructing participants to pay attention to the information in one modality and to ignore comp...
متن کاملThe Task-Dependence of Staged versus Cascaded Processing in Speech Production: An Empirical and Computational Study of Stroop Interference
We investigated the on-line relationship between overt articulation and the central processes of speech production. In two experiments manipulating the timing of Stroop interference in color naming, we found that naming behavior can shift between exhibiting a staged or cascaded mode of processing depending on task demands: an effect of Stroop interference on naming durations arose only when the...
متن کاملArticulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential
In our previous work, we have developed a speech modification system capable of manipulating unobserved articulatory movements by sequentially performing speech-to-articulatory inversion mapping and articulatory-to-speech production mapping based on a Gaussian mixture model (GMM)-based statistical feature mapping technique. One of the biggest issues to be addressed in this system is quality deg...
متن کاملArticulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models
This paper presents a novel speech modification method capable of controlling unobservable articulatory parameters based on a statistical feature mapping technique with Gaussian Mixture Models (GMMs). In previous work [1], the GMM-based statistical feature mapping was successfully applied to acousticto-articulatory inversion mapping and articulatory-to-acoustic production mapping separately. In...
متن کامل